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SUBJ: Analysis and Recommendations regarding the Clusters Program 



Multi-computer systems, including those comprised of Personal Computers, 
are becoming more prevalent in the marketplace. Increased requirements 
for reliability are just one of the reasons for this trend. w-« can no 
longer concentrate solely on engineering, manufacturing, selling, and 
servicing single computer systems. 

In the future, there win be less differentiation between the hardware 
and software of various computer vendors. Today's innovation will be a 
commodity tomorrow. Producing quality hardware and software will always 
be important* but we can no longer survive in the emerging markets 
simply by producing the best hardware and software. 

The CI Cluster Program provides Digital with opportunity to gain 
valuable experience in the multi-computer space* 

we must set up a structure wnich allows us to become a leader in the 
systems integration business. The vendor who Is capable of properly 
characterizing, installing t and servicing their multi-computer systems 
win likely be the vendor of choice. 

The CI Cluster Architecture provides significant technical innovation 
for Digital, we should not lose sales based on the functionality of the 
CI Cluster. Neither should we expect to win sales based solely on the 
functionality of the CI Cluster. Systems analysis and service 
capabilities tuned to the CI Cluster hardware and software win be of 
great importance. 

The Challenge over the next year will be to ensure that we set up a 
structure to allow us fully leverage the Cluster Architecture. CI 
Clusters will be the first complete multi-ccmouter systems Dlaitai 
delivers to the market. This will allow us to gain valuable experience 
in the installation and servicing of multi-computer systems. This 
expertise win be necessary to survive in the newly emergina computing 
markets. 



Clusters Analysis and Recommendations 



The ^a r ket 



'High Availability* is NOT the market for multi-computer systems. 
'Nonstop' will no lonaer be a point of differentiation as most vendors 
add fault tolerance to their products (the commodity effect!). 

Enhanced availability is becoming increasingly important in the aeneral 
purpose computer markets where we do most of our current business. In 
addition there is also a large demand for the ability to easily increase 
the capacity of a system. This is where the multi-computer cluster 
architecture is most applicable. 

The newly emerging markets demandina "systems' defined Dy personal 
computers and local area networks require a coherent multi-comouter 
architecture. This market is commonly referred to as the 'Office'. CI 
Clusters closely match the needs of the traditional general purpose 
market. The technologies, such as shared data bases, developed for the 
CI Cluster program should be transportable to the emeroing 'system' 
architecture needed for the office. 



The Cemoetition 



Tandem Computers will rely less on their 'Nonstop' architecture to sell 
systems in the future. They see the newly emerging market and will try 
to position themselves to be qualified in this space. They will use 
Distributed Data Base caoabillties. Mail and Transaction Processing 
software, and Satellite Communications as their new levers into the 
account. 

IBM will introduce products to compete in every market. They have 
stated their intentions to introduce fault tolerant extensions to one of 
their mainstream architectures. In most cases, IBM's hardware and 
software win not be the best available. The major threat from IBM is 
their potential (emphasize POTENTIAL) to be the best systems integrator 
(orimarily of their own gear) in the industry. They have not always 
demonstrated aptitude tor this, but they may view it as a strategic 
element in the near future. 

stratus, August Systems, and Intel are introducing multi-computer 
products into a variety of marketplaces. They are mentioned here to 
allow us to assess the impact of some of the new technoloaies. wone of 
these 'vendors is an immediate threat to us, but their methods deserve 
watching . 



Digital's Position 



The CI Cluster architecture will make us competitive with any vendor now 
competing for general purpose applications with needs for enhanced 
availability or system capacity. 



Clusters Analysis and Recommendations 



WE SHOULD NCT LOSE SALES BASED ON THE 'FUNCTIONALiTY OF THE Gl CLUSTER. 
It is conceivable that we could lose on price for the lower end systems, 
we might also be vulnerable to price/performance attacks in the nigner 
end configurations between now and the introduction of our next 
generation of VAX processors. 

In order to properly leverage our efforts in the multi-computer space I 
recommend the following: 

1. Agressively introduce the CI Cluster Program to the market with a 
program announcement (FALL 82) and product announcements CSPHING 
83). The motivation behind the program announcement is to gain 
visibility in the market at the earliest possible time, we must not 
allow any further competition to gain footholds in application areas 
of importance to us. 

2. Explore the possibility of providing a subset of CI Cluster 
functions on the N'l. 

3. Extend the 'systems oriented* programs recently intitiated in 32 bit 
engineering. «e need to be able to fully characterize, install* 
maintain, and refine our multi-computer offerings. We must develop 
the tools and expertise to configure, test, monitor, and tune thesfe 
multi-computer systems. I 

4. Provide services Cooth remedial and consulting) which complement the 
hardware and software being produced in the CI Cluster program. 
These services must be built upon the knowledge gathered in the 
systems programs initiated in engineering. Our service organization 
has a mass which can be used as a significant advantage over smaller 
vendors. 

5. Start building a second generation of high speed computer 
interconnects. Serious thought should be given to merging the HI 
and CI programs. This effort would produce better price/performance 
and also avoid oroduct confusion similar to that which exists today 
with the UNIBUS/QBUS. 

6. Initiate programs to move the technology developed for the CI 
Clusters program Cie., shared data bases, etc.) to our other 
multi-computer programs Cie., Personal Computers, LANS, etc.). Tne 
hardware/software/service technology established by the CI Clusters 
prooram will make our other multi-computer architectures more 
viable. 

7. Fully quantify tne impact of the Ulysses communications switch. If 
it is perceived to be deficient In any way, we must be prepared to 
fill the holes with other available solutions. 

8. The VAX Information Architecture provides Digital with powerful 
tools for both the general purpose market and newly emerging office 
markets, we must insure that VIA takes full advantage of new base 
VMS functionality produced as a result of the CI Cluster Program. 

The attached DRAFT document goes into more detail for each of the points 
discussed above. Please get back to me with your comments. 
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what is a Cluster? 



The purpose of this paper is to clarify the concept of Clusters, 
accurately convey what we are building and the implied benefits, and to 
highlight the strategic importance of the technologies and methodologies 
associated with the Clustered System Program, 

A CLUSTER is a group of cooperating COMPUTERS connected through a HIGH 
SPEED bus or link, CLUSTERS normally provide two important benefits as 
opposed to single computer systems. The first benefit is survivability. 
The second is modular expandability. Some of the identifyina 
characteristics of a CLUSTER are: 

1. The COMPUTERS are usually independent. Each COMPUTER has its own 
memory and its own copy of the operating system. Failure of one of 
the COMPUTERS in a CLUSTER should not affect the others. This 
topology is generally referred to as LOOSELY COUPLED 
MULTIPROCESSING. 

2. The CLUSTER is utilized in much the same fashion as a single 
COMPUTER as far as the users are concerned. Rarely are the 
COMPUTERS in a CLUSTER exoected to be secured or Protected from one 
another. For this reason, COMPUTERS in a CLUSTER are usually 
located in close proximity to one another. operational 
responsibility for the entire CLUSTER usually falls within a single 
organization. 

i 
i 

3. For the short term, the other reason why a CLUSTER has a restricted 
radius is the need for a HIGH SPEED bus or link. Perhaps, in the 
future, communications technology will deliver HIGH SPEED, lono 
distance links. HIGH SPEED can probably be defined as no less than 
1 megabyte/second. 

4. The COMPUTERS in a CLUSTER usually share a common file system or 
data base. 

5. Because of the shared file system, users of a CLUSTER usually do not 
have a preference for which COMPUTER they connect to. Therefore, 
flexible communications switches or patches are usually used in 
conjunction with CLUSTERS, 

6. The CLUSTER can be viewed as a single, larger, more dependable, more 
functional system than any of the component COMPUTERS, The shared 
resources and communications switching make the CLUSTER appear this 
way. 

7. while we normally view clustering as being applied to laraer 
computers (minicomputers are considered large these days), the same 
concepts can be applied to smaller computers, such as personal 
COMPUTERS, when PERSONAL COMPUTERS are clustered, no communications 
switches or patches are necessary, since the computer is also the 
terminal. Instead, Local ftrea Networks provide connectivity to all 
desired resources. 
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The Market 



when we view the market for clustered systems , there is always a 
temptation to equate it with the so-called *KIGH AVAILABILITY * market. 
Let us define the market and market size for clustered systems by 
breaking it into 3 segments. They are: 

1. The High Availability Market 

2. The General Purpose Computing Market 

3. The Emerging Market 

A sketch of each market and estimated market size follows, 

1. The High Availability Market 

This is a small market segment whose main identifying characteristic 
is the need to maintain operation nearly 1*00% of the time and to 
recover f row any failure within about a 1 second timeframe. Example 
applications in the High Availability Market are: 

1. Nuclear Power Monitoring and Control 

2. Air Traffic Control 

3. Space Flight 

4. Some Military Defense Applications 

These applications are characterized by the following: 

1» Need tor totally redundant hardware, with little concern for 
cost. This hardware is required to have f ailover times of less 
than 1 second in most cases and 'milliseconds* in some cases. 

2. Need to support special process interfaces. 

3. High liability if the 'failsafe* system fails. 

Approximate market size: Less than 1% of all Data Processing 
Revenues. 

Digital has a policy about such applications. We do not bid for 
them. 

2. The General Purpose Computing Market 

When most applications were automated for the first time, there was 
usually a manual backup system which could be invoked in the event 
of a failure. The main motivation for automating the application 
the first time was cost savings. Therefore, 'failures were a 
nuisance, but did not necessarily jeopardize the business. Payroll 
is an application with these characteristics. 
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Today, many applications .make significant contributions to tne 
profitability of a business. In some cases the -computer 
applications are tne reason tne firm can compete. If tne 
applications go down, the business is directly affected, A cas h 
management/funds transfer system at a bank is an application with 
these cnaracteristics . 

As applications go through their second Generation of automation, 
there is rarely a manual fallback system. 

Reliable computing is becoming a prerequisite for the sale of 
computer systems. This does not insply that more applications are 
becoming 'HIGH AVAILABILITY ' applications as defined in the previous 
section, but rather that all applications need some level of clean 
recovery. 

we might call this facility 'PREDICTABLE RECOVERY'. For most 
general purpose applications, a small amount of downtime is not 
critical, what is critical is the ability to ensure that there be 
no loss of data, no corrupted data, and some facility to restore the 
computing resource in a timeframe selected by the user. 

Just as important as reliable operation is the ability to easily 
expand tne capacity of the system. This is one of the messages 
which we have always used for DDP, the Clustered System Architecture 
magnifies this message. 

The requirements for reliable computing in the general purpose 
computing market are as follows: 

1, Total Data Integrity. Protection against hardware destruction 
and software pollution of data. 

2. Provisions for providing failover of hardware components. These 
facilities can oe manual or automated. The customer must be 
given tne cnoice of automated failover however. 

3. Comprehensive services, 

4, The ability to accomodate growth of the application without 
jeopardizing tne user's current investment. 

Approximate Market Size: 80% of all Data Processing Revenues. 

The Clustered System Approach provides basic building blocks for 
both reliability and expansion. This manifestation of DDP could be 
considered a complete alternative to mainframe processing. 

3. The Emerging Market 

Oyer the last two years the computer industry has gone through an 
upheaval. Office Automation has become the newest and biggest 
buzzword in the industry. Personal computers nave earned respect 
and have been 'blessed' by the two largest computer companies. 
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Office Automation and tne increasing emphasis or. personal computers 
have highlighted the need for nodularity in comcuter systems. Local 
Area Network technology has given us hope that we can put together 
modular systems with personal computers in the Office environment 
and in other applicable environments. 

The Emerging Market will attempt to utilize new technology to 
increase the overall productivity of their business entities, 
especially the office environment. The reason that the market must 
be labelled 'Emerging* is that there is no set definition for it 
yet. Every vendor entering this arena has their own definition 
concerning which technologies are Key (usually emphasizing the 
technologies they have available at the time). 

A few common threads run through the various definitions of the 
Emerging Market however. The technologies that seem to be required 
are: 

1. Personal Computers 

2. Local Area Networks 

3. General Purpose Computers 

4. Flex^ole Communications 

5. Data Management Call data types, managed across networks) 

Any vendor meeting all of the above requirements should fare well. 
However, the requirement for system integration is crucial in this 
market segment. 

The successful vendor will supply the components listed above and 
also serve as a systems integrator. The biggest opportunities for 
startup firms in the computer industry today lie in the area of 
system integration. These 'systems Houses* can pick and choose the 
best hardware and software available and add value by making it work 
together predictably. 

The definition of computer system is changing. No longer can we 
measure only mips and 1/0 bandwidth to accurately project system 
performance. The new 'system* is not self contained, it uses 
personal computers, servers, various interconnects, and traditional 
processors (and clusters of processors). 

The successful vendor in this market will be able to accurately 
define their new 'system' and also provide: 

1, Hardware and software as listed above. 

2. Accurate performance characterizations for their 'system' 
(integrated with the gear of others perhaps?), 'System' 
performance should oe predictable. 'System' performance should 
be easy to monitor and tune. 



Clusters Analysis 



Page 7 



3. Services oriented towards tuning their hardware and software 
'system' (considering the gear of others perhaps?) towards the 
application goals of the user, 

4. Ongoing supoort and maintenance of the new 'system'. Operation 
of the 'system* defined by personal computers and local area 
networks must not oe perceived to be more complex than operating 
a simole time sharing system. Response times must be comparable 
with that of timesnaring systems, 

5. Consistent interconnects which allow new processing units and 
servers to be Integrated into the 'system' while protecting the 
user's current investment. 

Approximate Market Size: ? 

Digital is going hard after this market. 
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The Competition 



For this disussion we will limit the review of competition to the 
following: 

1. Tandem 

2. IBM 

3. The New rfave (other companies with interesting potential, markets, 
or technology) 

Further analysis will be made availaDle at a later time. 

1. Tandem Computers 

Tandem Computers was founded in 1974 to provide multi-computer 
systems (CLUSTERS) oriented towards transaction processing 
applications with critical uptime requirements. Tandem Computers 
will sell approximately S350 Million of such systems this year under 
the trademark 'Nonstop'. 

Despite the name 'Nonstop', Tandem is not marketing in the 'HIGH 
AVAILABILITY* market outlined earlier in this document. They have 
instead concentrated on the general purpose transaction process ina 
market. Tandem has begun to do some repositioning into the newly 
emerging office market. 

They are positioning office automation as a natural extension of 
Transaction Processing (which has been their forte). By adding 
compatible support of new data types through their newly announced 
'TRANSFER, TRANSFER /MAIL, and TRANSFER/FAX* software they are 
broadening their scope of applicability, Tandem announced 
intentions to pursue high soeed/low cost transmission of data via 
satellite through a joint venture with American Satellite 
Corporation with a product called 'INFOSAT'. 

Tandem has created some new issues for the competition to address. 
They played this game with 'Nonstop', positioning highly reliable 
operation as a primary requirement whether it was or not. 
Similarly, I expect them to make integration of all data types with 
satellite transmission, using distributed data base software, their 
new wedge into the account. Tandem believes that these new products 
will keep the competition on the defensive, while placing themselves 
in the Office Automation game. 

'NonStoo' was (and will remain) a key buzzword in Tandem competitive 
situtations. 'Distributed Data Base', 'Multiple Data Types', and 
'Satellite Transmission' will become the new buzzwords in Tandem 
competitive situations. 

Tandem believes that their Distributed Data Base is their biggest 
point of differentiation today. 
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Other Key points derived from a presentation made by Tandem's 
president: 

1. They view their competition as almost exclusively IBM (tftey 
explicitely stated this). Tandem states that I5 M has a 'strike 
force* to comoete with. them. They also state that IB* will 
withdraw a old rather than lose the decision. 

2. They believe that IBM has traditionaly gained account control by 
controlling the central DP facility with centralized DP, 
centralized data bases, and hierarchical networks such as SNA. 

3. Tandem wishes to garner account control through Nonstop 
Distriouted Data Processing, with Distributed Data Bases Cthey 
oelieve that their relational, distributed data base capability 
is the cornerstone of their entire system), and more flexible 
networking architecture, 

4. Tandem perceives themselves as an 'End User* oriented company. 
They are committed to supplying very high levels of support 
(given that ISM is their competitive target, they wish to have a 
similar image). Tandem believes that they should be considered 
a 'mainframe vendor* which provides tools more in step with 
todays data processing needs. 

5. They believe that tneir products have evolved as follows: 



Nonstop Capabilities (including Data Integrity) 
extending to: 

Networks (including X.25, LANS, gateways, Satellite) 
extending to: 

Distributed Data Base capabilities (including Data Integrity) 
extending to: 

Transaction Processing (layered on Distributed Data Base) 

extending data types to: 
Image (Facsimile, Xerox, Graphics, Video) 
Voice (Digitized) 
Text 
Binary 



6. 



Tandem's target market segments are: 
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1. Large Sanies and Financial institutions. 

2. Large Manufacturing Companies 

3. Travel 

4. Transportation 

5. Airlines 

6. Telecommunications 



2. IBM 

IBM realizes tne importance of tne general purpose computing market. 
They realize that tne CLUSTER apDroacn to computing provides some 
very real benefits, particularly in the area of reliable operation. 
Because of these realizations, IB* has intimated that they are 
working on failsafe architectural extensions for one of their 
mainstream product families. 

They oercelve, and rightly so, that all vendors wili have to improve 
tne reliability of their general purpose computing products. IBM 
has not indicated any large interest in tne 'HIGH AVAILABILITY* 
market outlined earlier in this document. j 

IBM has expressed great interest in the Emerging market defined by 
Office Automation and Personal Computers. Where lew's hardware and 
software may not be up to standards at this time, they can be 
expected to improve. 

The far more urgent threat from IBM is their potential to do very 
well in the systems .integration part of the game. Having always 
been a service oriented company, providing a security blanket, IBM 
will invest heavily in being able ,to characterize the performance of 
their gear. They win also offer comprehensive services to neio 
ensure that the expectations of the user are met (whether or not 
that mean's re-setting the user's expectations). 

Digital is in for a major battle with IBM in the emerging markets, 
we have never been on more of a collision course with IBM in our 
history. For this reason, we must be prepared to invest heavily in 
the systems analysis, characterization, and service aspects. 

3. The New Wave 
1. Stratus 

Stratus is a sirall startup company which has targeted the 
general purpose transaction processing market. They are using a 
different architectural approach than Tandem however. stratus 
is relying almost exclusively on hardware redundancy to provide 
continuous processing. 
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In contrast to Tandem, who seem inclined to use custom logic for 
their processing engines, Stratus is using standard 
microprocessors in totally redundant conf iourations (they are 
using the Motorola 68000), 

Because of the low cost of the microprocessors, Stratus sees fit 
to place two microprocessors in each processing unit with 
comparators to check for consistent results. In the event of an 
inconsistent result, failover takes place to a totally separate 
processing unit which also has dual microprocessors. 

By using this approach, stratus claims that invalid results will 
never get through the system. Thus they perceive no need for 
recovery software of any kind. Their's is a totally hardware 
oriented approach. 

Contrast with Tandem or Digital's future offerings, which are 
combination hardware and software approaches. 

This approach has great marketing appeal, It is easy for the 
customer to understand and easy to contrast with more complex 
approaches. However, the chip level redundancy should be viewed 
simply as an alternative way to implement error detection on 'a 
board. Our boards might be as reliable as theirs, but we have ;a 
more difficult time explaining how we do error detection. This 
is unfortunate, because our error detection is probably more 
comprehensive (since they only check on microprocessor failure). 

It win be interesting to see how well Stratus* approach is 
received in the market place. Also how well the hardware only 
solution provides continuous processing. The approach warrants 
watching. 



2. August Systems 



August Systems is a small vendor targeting the 'HIGH 
AVAILABILITY' type of application outlined earlier in this 
document. The 'real time' nature of these applications 
differentiate them from the more 'data processing' oriented 

applications which Tandem, Digital, and IBM are targeting, j 

j 

August Systems "Can't Pail" system uses triple-redundanjt 
microprocessor based logic, triplex process interfaces, arid 
peripherals that can be triplicated depending on applications 
needs. 



August Systems is mentioned here so that we can watch how well 
they perform in very high risk application segments. j 



3. Intel 



Stratus is using the the M68000 microprocessor and is building a 
multiprocessor architecture around it. 



Intel is in the process of introducing their 432 microprocessor 
Into the marketplace. Its first point of differentiation is 
that has a very high level, object oriented instruction set. 



Clusters Analysis 



Page 12 



Its second point of differentiation is that they have built the 
rnultiorocessor capabilities into the architecture, 

Intel claims that the higher level, object oriented system will 
reduce the incidence of software failure. They also claim that 
the multiprocessor architecture will provide hardware fault 
tolerance. 

They do have an Achilles Heel in the approach however. Their 
multiprocessor architecture uses shared memory with no 
provisions for memory subsystem failure. They do not provide 
automated methods to recover from component failures. They 
simply provide the right hooKs to have many processors executinc 
from a common ban* of memory, 

where the high level, object oriented system might reduce the 
incidence of applications software failure, it will be 
interesting to see if implementing high level functions in logic 
and microcode proves less susceptable to system 'software* 
failures than implementing these functions in the operating 
system. Powerful tools are available for debugging operating 
system code today. Comparable tools are not yet available for 
debugging microcode and logic. 

The Intel 432 architecture is worth watching, It does not pose 
an immediate threat in the re'liable computing space. It has 
more potential to provide a wide performance range of high 
level, object oriented processing engines. This range of 
processing engines could be put together using Clustering 
techniques to provide highly reliable computing systems. This 
fits with Intel's strategy to marfcet the 432 almost exclusively 
through OEM channels. 
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CI Clusters 



In 01FY84 Digital win offer a Cluster Architecture based on the CI. 
This section win summarize the Key components of the arcnitecture. 



1, Component Descriptions 



1. Computer Interconnect CCD 



The CI is a high speed (70 megabits/second), muitldropped, short 
distance (90 meter radius) interconnect designed to pass data 
and control information among intelligent computers. 



The computers currently supoortinq the CI are as follows: 



1. VAX-li/780 



2. VAX-H/782 



3. VAX-11/750 



4. 2060 



5. 2080 (JUPITER) 



6. VENUS 



7, HSC-50 (1/0 server) 



The CI Port Interfaces themselves are intelligent. The CI port 
Interface's have been designed to utilize the page tables and 
virtual addresses of the supported VAX systems, thus making bul'fc 
data transfers very efficient. Reliable transmission is 
guaranteed by protocols implemented in the port. 

A dual path facility has been built into the CI port 
architecture to provide for redundancy. Under normal operating 
conditions , the dual path facility can provide enhanced 
performance. 



2. I/O Server (HSC-50) 



The HSC-50 is an intelligent mass storage subsystem. When 
integrated into a CI Cluster, the HSC-50 is utilized as a common 
I/O Server for all nost computers residing within the CT 
Cluster. 



Each HSC-50 is counted as a node in a CI Cluster, The HSC-50 i:s 
a computer, one which has been optimized towards managina tne 
flow of information between large mass storage devices and one 
or more host computers. 

The HSC-50 relieves the host software of the burden of 
performance optimization, disk personality, and error recovery. 
The HSC-50 always presents 'logically perfect' volumes to th> 
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host computers. For sore acplications , the I/O Server will also 
maintain shadowed copies of selected disic volumes, fchen both 
shadow volumes are online, a performance benefit can be expected 
since the system will access data from both volumes. An 
additional access arm to the data is available. 

Plans are in place to provide utilities to perform volume 
backups from disK to taoe without host intervention, 

A bank of volumes can be dual ported between a pair of H5C-50s. 
The H5C-50s can share the I/O processing load (static dual 
porting only). If one HSC-50 in the pair should fail, the 
surviving unit can automatically restore service for volumes 
previously owned by the failed HSC-50, This failover takes? ^ 
place without loss of outstanding I/O requests. ^tuUtt. 

The HSC-50 is a special purpose computer optimized for servicing 
I/O from large mass storage devices. General purpose computers 
with traditional mass storage interfaces are inherently less 
efficient at this task, 

3, System Communications Architecture (SCA) 

The SCA Is a layer of software which implements the eauivalent 
of network functionality between computers within a Cluster. To 
understand the difference between SCA and DNA we must first 
study the major differences between a Network and a Cluster. 

1. Networks are usually geographically dispersed (although this 
is not necessary). 

The computers within a Cluster are usually co-located within 
the same facility. This is true since the primary 
motivations oehind implementing Clusters are to provide 
larger capacity computer systems and to provide redundancy 
within a computer system, 

2. Nodes within a Network are usually controlled and operated 
by several different organizations within a business entity. 

Computers within a Cluster are usually controlled and 
operated by the same organization within a business entity. 

3. To access a Data Base on a remote node within a network, the 
requesting node must be given positive authorization by the 
serving node. It has been said that nodes within a network 
are 'mutually suspicious'. 

Data Bases and other resources are considered to be shared 
eaually among ail computers within a Cluster, When an 
additional computer is added to a Cluster, it is considered 
to be equal partner sharing all resources with the other 
computers in the Cluster. The computers In a Cluster are 
"mutually benevolent*. 
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*ith current communications technology, routing techniques 
provide more flexible and lower cost network topologies. 

Communications technologies used to implement Clusters allow 
for full multidrop topologies. There is no need for routing 
functionality 'within Clusters. 

Communications across networks primarily takes Place between 
cooperating apolication processes. The system utilizes the 
network primarily to provide resource snaring functions. 

Communications within a Cluster takes oiace primarily 
between the member computer systems. Large data transfers 
between nosts and servers and resource contention control 
messages between cooperating hosts comprise the bulk of the 
traffic across the Cluster link. Of lesser magnitude are 
messages between cooperating processes on separate computers 
within the cluster. 



The System Communications Architecture CSCA) was developed to 
provide for highly efficient data flow between computers within 
a Cluster. The SCA provides the backbone transport mechanism 
for ail other cluster software. The efficiency provided by SCA 
is necessary to transform a group of independent computers into 
a cluster. 

4, Mass Storage Control Protocol CMSCP) 

A MSCP has been devised to allow for flexible connection of new 
mass storage devices to computer systems, with the advent of 
intelligent disk controllers such as the UOA and the H5C-50, it 
is now possible to implement disk drivers which can fie 
insensitive to changes in tne characteristics of the drives 
themselves, and also insensitive to changes in tne transport 
mechanism from drive to computer memory. j 

These new drivers are called Class Drivers. These Class Driverls 
implement the 'master' side of the MSCP. The intelligent 
controllers implement the 'server' or 'slave' side of the MSCP,! 

Operating systems can support new disk technology in a more 
timely fashion by using this class driver scheme. In addition, 
new transport mechanisms, such as the CT architecture, can be 
more easily leveraged. 

If a new interconnect is introduced it is now possible to 
support by simply writing a port driver interface to the new 
interconnect. If a new controller is introduced it is now 
supportable by simply writing the 'slave' side of the MSCP in 
the new controller. 

Because every 'master' reauest must be positively acknowedged by 
a 'slave' when usina mscp, it is possiole to cleanly implement 
device and controller failover in the system. I/O requests are 
never lost and can be retried in the reconfigured system in the 
event of a failure. 
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The 'slave' or 'server* side of the M.5CP has also been 
implemented on vax/Vms. This allows current VAX systems with 
local mass storage to be cleanly Integrated into CI Clusters 
with no loss of user investment. E-ach VAX system with local 
mass storage can act as a server thus ma<ing its storage 
transparently available to other VAX systems within the Cluster, 

5, Distributed Loclc Manager 

The Distributed Loclc Manager allows VAX/ VMS to Implement a true 
shared file system across a Cluster. The Loclc Manager is 
resident on each vax system within the cluster. 

An application process wishing to access a particular record 
within the shared data base of the Cluster makes a record loclc 
request to the Distributed Loclc Manager. Once the loclc reouest 
has been granted, no other application process on that computer 
or any other computer within the cluster can secure a lock on 
that particular record. 

The distributed implementation of the loclc manager Cand the disk 
ACPs) ensures that there is no single resource allocation 
bottleneck within the Cluster. Tradeoffs have been made in the 
lock manager to optimize for normal operations rather than 
failure recovery. The minimum amount of interprocessbr 
information is passed during normal operations. Enough 
information is passed to allow surviving computers in a cluster 
to deriv e the Iock Information of a failed computer. The 
surviving computers can then release loc<s held by applications 
which were executina on the failed computer. 

6. Common Journalling Facility CCJF) 

The CJF provides a series of system services which allow any 
Data Base Management System to create and maintain journals of 
data base activity. 

The CJF facilitates the creation of Before Image Journals which 
could allow data bases to be 'rolled baclc' to some Known, 
consistent state. 

The CJF facilitates the creation of After Image Journals which 
could be applied to Backup copies of the data base. This allows 
for 'roll forward' reconstruction of data bases destroyed by 
hardware failure or corrupted by software failure. 

The CJF also allows for applications to maintain user defined 
audit trails of data base or other system activity. 

The CJF allows for any number of Data Base Managers to share the 
same journal volumes. These journal volumes are normally 
magnetic tape, but can also be disic volumes. 

The participating Data Base Management Systems are responsible 
for providing the utilities which apply journalled data in 'roll 
baclc' or 'roll forward' recovery situations, These utilities 
are currently being written for RMS and DBWS, 
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7, Recovery Units 

The Recovery units facility allow Data Base Manaaement Systems 
to dynamicly maintain the consistency of their data bases in the 
face of transaction, or system failure. 

The Recovery Units facility provides two simole calls which 
allow applications to protect themselves from data corruption. 
The first call, normally invoked when the data base is in a 
Known, consistent state, creates a recovery unit, Once a 
recovery unit has been created, the system is directed to secure 
(normally on disk) 'before image' copies of data oase records 
affected by the transaction. The second call, normally invoiced 
when the transaction has been completed and the data base is 
once again in a consistent state, purges the recovery unit. 

If the transaction or system should fall while the recovery unit 
is open, the system will 'roll back' the effects of tne 
transaction, thus bringing the data base to the consistent state 
which existed at the beginning of the recovery unit. At that 
point, the transaction can be retried. 

In the case of an application failure or aborted transaction* 
the VAX system on which the transaction was running will do the 
'roll back*. In the case of a system failure, the surviving 
systems within the Cluster will 'roll back' all active recovery 
units opened by the tailed system, 

8. Checkpointing Facility 

A Checkpointing Facility is being provided to allow applications 
with a critical investment in processing to protect that 
investment. This is useful in two scenarios. The first is tne 
application which runs a single monolithic dob for long periods 
of time (usually hours). This type of application is typical In 
engineering and simulation applications. The second type is tne 
application which reguires that transactions be automatically 
retried (in the event of a failure) without additional operator 
interaction. In both cases the implementation is the same. 

The application defines a checkooint, usually at some consistent 
point in its execution. In the 'retry transaction' case, the 
checkpoint should be done immediately after ail transaction 
inputs have been received. At the time of the checkpoint, the 
system secures all altered pages, in the virtual address space 
of the process, to a checkpoint file. The application can then 
resume processing for some amount of time. In the event of a 
failure, which causes the application to abort, the system can 
reinitiate the application from the point of the last defined 
checkpoint by reconstructing the state of the process from the 
checkpoint file. Because the checkpoint was secured to disk, 
the application could conceivably be brought up on a different 
processor within the Cluster (given that the new processor is 
the same tyoe as the original processor - for example 780 -> 780 
). 



Clusters Analysis 



Page 18 



The checkpoint facility, in many cases is combined with the 
Recovery Units facility. In this case, the recovery unit is 
'rolled back* before the application is reinitiated from the 
checkpoint. The system will ensure that Recovery Units and 
Checkpoints are declared in a logical fashion when they are used 
in unison. 

9. Ulysses Communications Switch 

In previous sections the benefits of a Cluster were broken into 
to major categories. The first being the ability to add 
incremental processing capacity due, in most part, to the shared 
data base. The second benefit is the ability to provide 'sDare' 
processing capacity with automatic failover to surviving units. 

In order to fully realize the above benefits, there must be a 
facility to automatically switch terminals and communications 
lines from one computer within the cluster to another. To meet 
this reauirewent the Ulysses communications switch is being 
used. 

There are two major points of differentiation for the Ulysses 
switch veasus other similar switches. The first is the ability 
to concentrate lines and ports near their points of origin and 
use single high speed lines to the switch itself. The second 
point is that the switch is controlled primarily by software 
resident in the host computers within the cluster, with the 
flexibility of host control, some crude load leveling can be 
implemented. 

The Ulysses switcn can be configured in a fully redundant 
fashion. 

Further detail on the Ulysses switch win be made available at a 
later time. 



2. Possible Extensions 

The Cluster Architecture could gain more flexibility by implementing 
Disk Volume Shadowing on disks directly connected to the host 
computers (through the UDA). This would allow for Clusters with a 
lower entry price. Currently Disk Volume Shadowing is available 
only on the HSC-50 I/O Server, thus reguiring inclusion of an HSC-50 
in order to provide the highest levels of data integrity. 

The ability to perform Disk Volume Shadowing is also a capability 
useful outside the realm of Clustered Systems. Many single computer 
applications have stringent reguirements for protection of data, in 
many cases, loss of the computing service is not critical, but loss 
of data can be a disaster. In these cases, providing Disk Volume 
Shadowing exclusive of the Cluster Architecture and the HSC-50 I/O 
Server would be desirable. 



3. The Future 
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Trie. CI Cluster Architecture nas supplied Digital with some very 
significant benefits-. They are: 

1. The ability to provide survivable systems through extensions to 
the mainstream vax family. 

2. A nedae for high end systems. Except in the case where a very 
powerful compute engine is needed for monolithic compute jobs, 
the Cluster Architecture will allow us to cleanly increase the 
caoacity of multi-user systems while protecting the user's 
current investment. This allows us to' 'do the right thing' with 
technology at the high end by relieving the pressure to rush tne 
next high end engine out the door. 

3. The software tecnnologies employed have solved some of tne 
crucial problems associated with distributed data bases. 



Because of the above general benefits, we should commit to providing 
similar capabilities on future members of the VAX family (and 
beyond?). 

In tne near future, this means initiating projects to provide *CI 
liice' capabilities on SCORPIO and NAUTILUS. 
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Related Products 



The technologies explored and implemented in the CI Cluster Program are 
significant. The following programs nave potential to enhance the 
Cluster Architecture- These programs can also leverage the experience 
we have gained in tne design and implementation of CI Clusters. 

1. Local Area Networks (Nl) 

It is possible to orovide lower cost (perhaps less functional) 
clusters by substituting CI with ni. we should be able to properly 
characterize the potential of this aoproach. 

As we move towards more modular systems in the future it win be 
difficult to protect our user's investment with two similar 
interconnects. Communications technology might allow us to produce 
a Local Area Network interconnect which approaches the speed of the 
CI. If this occurs, will there be a need for two separate 
interconnects with different sets of servers for both? 

2. Personal Computers 

There is a requirement (If we intend to build 'systems* defined by 
personal computers and LANs) to provide for transparent data base 
access between Personal Computers, Servers, Networks, and general 
purpose computers. ; 

It would be desirable to extend the Mass Storage Control Protocol, 
Distributed Lock Manager, and related data base software, developed 
as part of the CI Cluster project to our Personal Computer Clusters. 
This may not be feasible until a 32 bit engine is available for our 
personal computers. Could we limit the scope enough to solve the 
problem with tne 16 bit engine of today? Some of the research and 
prototyoe efforts for smart caches, distributed forms, and 
distributed editors may be applicable here. 

3. Data Base Management 

The shared data base Droduced for CI Clusters has attacked many of 
the classic problems of the distributed data base. Is 
communications speed the only gating factor preventing us from 
having similar shared data bases across Networks? Pernaps the 
security issues are a major obstacle. 

True DATA BASE MACHINES (as constrasted with I/O and File Servers) 
should be considered carefully. 

4. VAX Information Architecture 

Tne Common Journalling Facility, Distributed Lock Manager, Recovery 
Units, and Checkpointing Facility provide a much more solid base for 
the VAX Information Architecture than exists today, we must ensure 
that the higher level components of VIA taice full advantage of these 
powerful new capabilities. 
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In addition, the Application Control Management System (ACMS) and 
tne Transaction Processina Development System CTPDS) , formally 
called TPS5, provide the nignest layers of VIA. The nign level 
Application Control facilities provided oy ACMS should ma<e Office 
applications isuch easier to conceive and implement. In addition 
there is potential for ACMS to maxe tne Cluster System Architecture 
more powerful by providing load balancing or job partitioning 
functions. we should !*ove agressively towards integrating thes ! e 
products cleanly into the Cluster System Architecture. 
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Impact of New Technology 



1, VLSI 

The obvious impact of VLSI is that it should provide better price 
performance in computers, servers, and communications ports. There 
will also be a trend towards putting higner level functions into the 
logic of the processors themselves. An example of this is the Intel 
432 discussed briefly earlier in this document, 

while striving for more reliable systems, the initial concentration 
has been on ma'King the hardware more reliable. Very little 
practical worx nas been done to make software Cor logic) more 
reliable. Hard failures are much easier to recover fro-rs tnan soft 
Cor semi-soft) errors. There are some interesting theories- in the 
area of software fault tolerance however. 

It may be possible in the future to imolement the higher level logic 
of a processor several different ways on a chip csince silicon area 
will not be at a premium). By applying success criterion to the 
operations, and providing facilities to bacic out non-successful 
operations, several different algorithms could.be tried, Peter Lee 
of Digital's Advanced System's Research Group is our resident expert 
in this area of concern, 

2, The Server Architecture 

As we move closer to system architectures comprised solely of 
personal computers and a complement of servers, we must successfully 
deal with the transition from the traditional 'host computer* 
architecture of the past. There is a tremendous desire to produce 
and deliver these new 'systems* today, but our investments in new 
processors Cthe ones with active, funded projects today) are 
considered more as follow ons to our traditional lines of computers. 
Thus, the transition from traditional computing to the' new Server 
Architecture becomes more difficult since we tend to lock our 
customers into the follow on traditional computers. 

One way to deal with this problem is to ensure that the 
interconnects for our traditional system Clusters and the 
interconnects for our Server Architecture 'systems' converge. in 
this way, the two approaches can more closely comolement one 
another. Additionally, we might start looking at future traditional 
processors as servers Ceven if they are considered high performance 
comoute servers). 

Servers will expected to oe very hiahly reliable nodes within the 
new system architecture. Perhaps chip or modular level redundancy 
Cala the stratus approach) should be studied for these critical 
components. 

3, Communications 
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CoffT.uniati.ons technology, particularly satellite, may allow us to 
cleanly migrate our Cluster Architecture functionality to more 
geographically dispersed topologies (Tandem is heading in this 
direction). What are the security implications? Encryption of 
satellite is prooaoly a must. 
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Issues 



The CI Clusters Architecture is a significant milestone in Digital's 
engineering history. Although the initial oesign center for the program 
was the 'HIGH AVIALABILITY * space, the program has provided 
contr ioutions in other areas. The program has been in place for seme 
time now, and products are due to be delivered in Q1FY84. The following 
is a list of issues which need to be addressed to ensure that the 
program is successful in the marketplace and to ensure that we are able 
to leveraoe technological advances produced by the CI Cluster Program in 
other strategic programs within Digital. 



1. Product Introduction and Promotion 



It is well understood what products will be delivered in Q1FY84. 
The introduction and promotion of this program should have very high 
priority, we cannot miss the opportunity to leverage the technical 
innovation produced by the CI Cluster Program. The CI Cluster 
Program and product announcements scheduled for G2FY83 and Q1FY&4 
should be treated in a fashion comparable to the Ethernet program 
announcement and ether VAX family announcements of recent years. 

Suggested action: Firm budgets for program and product 
announcements • 



2. System Characterization 



It is vitally important that we be able to properly characterize the 
performance of CI Clusters. The recent formation of a Systems 
oriented group within 32 bit engineering is a step in the rigfrt 
direction. The current level of funding for this group win ensure 
that the VAX-li/780 is thoroughly tested. Some additional 
maintainability tools will also be produced. 

More committment should be made to testing VAX-1 1/750 
configurations. 

Committments should be made to include measurements of CI Clusters 
in all applicable performance studies within Digital. 

Suggested action: Additional emphasis on Systems testing within our 
engineering organization. 



3. Services 



In order to ensure a smooth introduction of the CI Cluster Program 
into the marketplace, our Hardware and Software services 
organizations must have programs tailored to this new architecture. 



These programs should include: 



1. Remedial Software Support 
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2* Hardware field service support complementary to that provided on 
current single system offerings. It must be clarified how 
programs like the 'Guaranteed Uptime program* will relate to 
Clusters. 

3. Consulting Service offerings to allow customers to ta<e full 
advantaae of both the performance and redundancy benefits of tfte 
Cluster Architecture. Clusters win be more difficult to tune 
than the single computer systems we are selling today. These 
consulting services must be in place to ensure that the initial 
customers for CI Clusters are successful. 



service offerings should -be the delivery mechanism for the knowledge 
we acquire in our 'systems group' within engineering, 

Suggested action: Continued interaction with tne appropriate 
service organizations. I 

4. Interconnects 

The Cluster Architecture is built around the CI today. This is a 
good match. It is possible however, that the NI could also provide 
a reduced level of functionality within the Cluster Architecture. 
This opportunity should be explored. 

j 

we must now start thinking of a second generation of interconnects 
to provide follow ons to both the CI and NI. If the programs 
converged it might be easier to provide servers for a wider range of 
application needs. It might also be easier to protect our user's 
investment in equipment over time. If the programs do not converge 
it is possible that we will have a problem similar to the 
UNIBU5/0BUS on PDP-llS. 

Suggested action: Inclusion of NI into the Cluster Architecture. 
Research into a second generation of interconnects. 

5. The new 'system' 

It will be necessary to extend functionality now provided only 
within the context of the CI Cluster Architecture to the new 
'system' defined by personal computers and local area networks. How 
much can we leverage experience gained in the development of the CI 
Cluster Architecture? 

Suggested action: Formation of a new 'systems' group. 

6. Communications switching 

Although the Ulysses communications switch provides a flexible 
solution to most switching problems, it doesn't cover all the 
problems. 

There are no plans to failover DECNET links for example.. 
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Lower priced Clustered systems may need less aeneralized , lower cost 
communications switching. 

Suggested action: study alternatives to Ulysses for lower priced 
systems • 
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The Challenge 



Multi-computer systems, including those comprised of Personal Computers* 
are becoming more prevalent in the marketplace. Increased requirements 
for reliability are just one of the reasons for this trend, we can n© 
longer concentrate solely on engineering manufacturing, selling, and 
servicing single computer systems. 

In the future, there win be less differentiation between the hardware 
and software of various computer vendors. Today's innovation will be a 
commodity tomorrow. Producing quality hardware and software will always 
be important, Dut we can no longer survive in the emerging markets 
simply by producing the best hardware and software. 

The CI Cluster Program provides Digital with opportunity to gain 
valuable experience in the multi-computer space. 

we must set up a structure which allows us to become a leader in thje 
systems integration business. The vendor who is capable of prooerliy 
characterizing, installing, and servicing their multi-computer systenjs 
will lively be the vendor of choice. | 

The CI Cluster Architecture provides significant technical innovation 
for Digital, we should not lose sales based on the functionality of thi* 
CI Cluster. Neither should we expect to win sales based solely on tfi« 
functionality of the CI Cluster. Systems analysis and service 
capabilities tuned to the CI Cluster hardware and software will be of 
great importance. 

i 

The Challenge over the next year win be to ensure that we set up a 
structure to allow us fully leverage the Cluster Architecture. CI 
Clusters will be the first complete multi-computer systems Digital 
delivers to the market. This will allow us to gain valuable experience 
in the installation and servicing of multi-computer systems. This 
expertise will be necessary to survive in the newly emerging computing 
markets. 
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